AITopics | lipschitz constant

Collaborating Authors

lipschitz constant

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Beyond Lipschitz: Data-Driven Robustness via Discrete Modulus of Continuity

Dölz, Jürgen, Multerer, Michael, Palma, Michele

arXiv.org Machine LearningMay-28-2026

Robustness of neural networks is commonly quantified via local or global Lipschitz constants. However, Lipschitz continuity can be overly coarse or overly restrictive as global robustness measure, failing to capture nuanced, data-dependent behavior. We propose a data-driven, architecture-agnostic framework based on the discrete modulus of continuity (DMOC), a non linear generalization of Lipschitz continuity that provides a finer notion of robustness. Unlike many existing approaches, DMOC does not require access to model internals and instead evaluates regularity relative to the data distribution. This shifts the focus from the model to the data, which provide a data-driven baseline of regularity against which the network's robustness is assessed. We establish convergence results for DMOC-induced seminorms with explicit data-driven rates in terms of the separation distance, and introduce a scalable minibatch algorithm that reduces the quadratic cost of exact computation, enabling application to large-scale data sets such as ImageNet. Empirically, DMOC serves as an architecture independent diagnostic: it distinguishes trained from untrained networks, reveals underfitting and overfitting regimes, and yields, as a special case, tight Lipschitz estimates comparable to state-of-the-art method such as ECLipsE and ECLipsE-fast.

artificial intelligence, dmoc, machine learning, (17 more...)

arXiv.org Machine Learning

2605.28729

Country:

North America > Canada (0.69)
Europe (0.68)
North America > United States > California (0.16)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Why Does Agentic Safety Fail to Generalize Across Tasks?

Slutzky, Yonatan, Alexander, Yotam, Slor, Tomer, Nagel, Yoav, Cohen, Nadav

arXiv.org Machine LearningMay-11-2026

AI agents are increasingly deployed in multi-task settings, where the task to perform is specified at test time, and the agent must generalize to unseen tasks. A major concern in such settings is safety: often, an agent must not only execute unseen tasks, but do so while avoiding risks and handling ones that materialize. Empirical evidence suggests that even when the ability to execute generalizes to unseen tasks, the ability to do so safely frequently does not. This paper provides theory and experiments indicating that failures of agentic safety to generalize across tasks are not merely due to limitations of training methods, but reflect an inherent property of safety itself: the relationship between a task and its safe execution is more complex than the relationship between a task and its execution alone. Theoretically, we analyze linear-quadratic control with $H_{\infty}$-robustness, and prove that the mapping from task specification to an optimal controller has higher Lipschitz constant with safety requirements than without, yielding a Lipschitz bound of independent interest. Empirically, we demonstrate our conclusions in simulated quadcopter navigation with a neural network agent and in CRM with an LLM agent. Our findings suggest that current efforts to enhance agentic safety may be insufficient, and point to a need for fundamentally different approaches.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2605.06992

Country: North America > United States (0.92)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (0.92)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Reliable Estimation of KLDivergence using a Discriminator in Reproducing Kernel Hilbert Space Supplementary Material

Neural Information Processing SystemsApr-25-2026, 23:05:59 GMT

Organization: This supplementary material is presented in a format parallel to the main paper. The section numbers and titles are consistent with the main paper. But, here we also add one new section: Section 10 where we describe the societal impacts and possible negative impacts of the paper. Similarly, the Theorem numbers are consistent with the main paper, but we also have several additional theorems and lemmas which were not included in the main paper. GAN-type Objective for KLEstimation Let f be a discriminator, f: X IR. Let p(x) and q(x) be two probability density functions defined over the space X.

artificial intelligence, dim, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

50e207ab6946b5d78b377ae0144b9e07-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 21:28:37 GMT

artificial intelligence, machine learning, nullz 0, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.45)

Add feedback

Appendix Impact

Neural Information Processing SystemsApr-25-2026, 16:22:49 GMT

The SC stands for the spectral complexity defined in [4]. We use the empirical estimation of k-variance and Lipschitz constant defined in section 5 to calculate kV-Margin and kV-GN-Margin. B.2 Variance of Empirical Estimation In Table 1, we show the average scores over 4 random sampled subsets. We now show the standard deviation in Table 4. Overall, the standard deviation of the estimation is fairly small, consistent to the observation in Theorem 7.

artificial intelligence, lipchitz, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Measuring Generalization with Optimal Transport

Neural Information Processing SystemsApr-25-2026, 16:22:46 GMT

Understanding the generalization of deep neural networks is one of the most important tasks in deep learning. Although much progress has been made, theoretical error bounds still often behave disparately from empirical observations. In this work, we develop margin-based generalization bounds, where the margins are normalized with optimal transport costs between independent random subsets sampled from the training distribution. In particular, the optimal transport cost can be interpreted as a generalization of variance which captures the structural properties of the learned feature space. Our bounds robustly predict the generalization error, given training data and network parameters, on large scale datasets. Theoretically, we demonstrate that the concentration and separation of features play crucial roles in generalization, supporting empirical results in the literature.

artificial intelligence, generalization, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

2bd7f907b7f5b6bbd91822c0c7b835f6-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 06:52:24 GMT

artificial intelligence, information management, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
Oceania > Australia (0.28)
North America > United States > California (0.28)

Genre: Research Report (0.93)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Information Management (0.68)

Add feedback

1102a326d5f7c9e04fc3c89d0ede88c9-Supplemental.pdf

Neural Information Processing SystemsApr-24-2026, 18:28:09 GMT

This is the distribution over datasets one obtains by first sampling a task t from Pt, and then sampling a dataset S from Pmz|t. Here p(S) corresponds to the marginal distribution over datasets S. Note that the last line above holds because E P f(,S) does not depend on t. Thus, in this section, we present a specialization of the bound for Gaussian distributions. Let P have mean µ and covariance; thus P = N(µ,) and analogously P,0 = N(µ0, 0). We can then apply the analytical form for the KL-divergence between two multivariate Gaussian distributions to the bound presented in Theorem 3. The result is the following bound holding under the same assumptions as Theorem 3: L(P,Pt) 1 l We implement the above bound in code instead of the non-specialized form of the KL divergence to speed up computations and simplify gradient computations. A.3.2 Few-Shot Learning Bound with Validation Data In this section, we will assume that, in addition to the training data S Pmz|t, we have access to validation data Sva Pnz|t at meta-training time. We will show that a meta-learning generalization bound can still be obtained in this case.

adaptation step, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Industry: Education (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Filters

Collaborating Authors

lipschitz constant

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Beyond Lipschitz: Data-Driven Robustness via Discrete Modulus of Continuity

Why Does Agentic Safety Fail to Generalize Across Tasks?

Reliable Estimation of KLDivergence using a Discriminator in Reproducing Kernel Hilbert Space Supplementary Material

50e207ab6946b5d78b377ae0144b9e07-Supplemental.pdf

Appendix Impact

Measuring Generalization with Optimal Transport

3cec07e9ba5f5bb252d13f5f431e4bbb-Paper.pdf

2bd7f907b7f5b6bbd91822c0c7b835f6-Paper.pdf

2c19666cbb2c14d45d39e2dcf6ab0b99-Paper-Conference.pdf

1102a326d5f7c9e04fc3c89d0ede88c9-Supplemental.pdf